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(54) Processor for executing highly efficient VLIW instructions 


(57) A 32-bit instruction 50 is composed of a 4-bit 
format field 51 , a 4-bit operation field 52, and two 1 2-bit 
operation fields 59 and 60. The 4-bit operation field 52 
can only include (1 ) an operation code "cc" that indicates 


a branch operation which uses a stored value of the im- 
plicitly indicated constant register 36 as the branch ad- 
dress, or (2) a constant 'const". The content of the 4-bit 
operation field 52 is specified by a format code provided 
in the format field 51 . 
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Description 

BACKGROUND OF THE INVENTION 

(1) Field of the Invention 

The present invention relates to a processor with 
VLIW (Very Long Instruction Word) architecture, and in 
particular to a processor that executes instructions with 
comparatively short word length and high code efficien- 
cy. 

(2) Description of the Prior Art 

With the increase in demand for multimedia devices 
and the miniaturization of electronic circuits in recent 
years, there has been a growing need for microproces- 
sors that can process multimedia data, such as audio 
data and image data, at high speed. One kind of proc- 
essors that are capable of meeting this need are proc- 
essors that use VLIW architecture, these being herein- 
after referred to as "VLIW processors". 

VLIW processors include a number of internal op- 
eration units and so are able to simultaneously execute 
a number of operations in one VLIW in parallel. Such 
VLIW are generated by a compiler that investigates the 
extent to which parallel processing is possible at the 
source program level and performs scheduling. For em- 
bedded microprocessors used in consumer appliances, 
however, it is important to suppress the code size of pro- 
grams, so that 256-bit VLIW, with their high incidence of 
no-operation instructions (hereinafter referred to as 
"NOP instructions") and resulting poor code efficiency, 
are far from ideal. 

One example of a VLIW processor that executes 
instructions with relative short word length is Japanese 
Laid-Open Patent Application H09-26878. This tech- 
nique teaches a data processing apparatus that is a 
VLIW processor for executing 32-bit instructions that 
can simultaneously indicate a maximum of two opera- 
tions. 

Figs. 1 A and 1B show the instruction format of the 
stated technique, with Fig. 1 A showing the instruction 
format for simultaneously indicating two operations and 
Fig. 1 B showing the instruction format for indicating only 
one operation. This technique aims to improve code ef- 
ficiency by including a 2-bit value in the format field 41 0 
that shows the number of operations in each instruction 
and the execution order. 

The indication of a maximum of two operations by 
a single 32-bit instruction, however, does not achieve a 
sufficient degree of parallelism. There is also the prob- 
lem of decreases in code efficiency of instructions when 
performing an operation using a constant that exceeds 
a given word length. As one example, when a 32-bit con- 
stant is split into an upper 16 bits and a lower 16 bits so 
that it can be set into registers, two 32 -bit instructions 
are required just to indicate an operation using this con- 


stant. 

SUMMARY OF THE INVENTION 

s in view of the stated problems, it is a first object of 
the present invention to provide a VLIW processor that 
executes instructions of comparatively short word 
length, but which have a high degree of parallelism and 
a highly efficient code structure so that several opera- 
tions can be simultaneously indicated. As one example, 
three or more operations can be indicated by a single 
32-bit instruction. 

it is a second object of the present invention to pro- 
vide a VLIW processor for executing instructions of a 
comparatively short word length that have a structure 
whereby the overall code efficiency will be comparative- 
ly unaffected even when handling constants of compar- 
atively long word length. 

The first object can be realized by a VLIW (Very 
Long Instruction Word) processor that decodes and ex- 
ecutes an instruction that has at least two operation 
fields, of which a first operation field can only include 
one operation code for specifying an operation type and 
a second operation field includes a combination of one 
operation code and at least one operand used in an op- 
eration indicated by the second operation field, the 
VLIW processor including: a first decoding unit for de- 
coding the operation code in the first operation field; a 
first execution unit for executing an operation indicated 
by the operation code in the first operation field in ac- 
cordance with a decoding result of the first decoding 
unit; a second decoding unit for decoding the operation 
code in the second operation field; and a second exe- 
cution unit for executing the operation indicated by the 
operation code in the second operation field on data 
which is indicated by the operands in the second oper- 
ation field, in accordance with a decoding result of the 
second decoding unit. 

By doing so, since at least one operation in the in- 
struction can be indicated by merely inserting an oper- 
ation code without an explicit indication of an operand, 
the word length of instructions can be reduced. As a re- 
sult, a VLIW processor that executes instructions of 
comparatively short word length, but which have a high- 
ly efficient code structure so that several operations can 
be simultaneously indicated is achieved. 

Here, a number of bits occupied by the operation 
code in the first operation field may be equal to a number 
of bits occupied by the operation code in the second op- 
eration field. 

As a result, all operation codes that are included in 
an instruction will be composed of the same number of 
bits, which simplifies components such as the decoder 
circuits. 

Here, the instruction may include three operation 
fields, wherein a third operation field in the three oper- 
ation fields may occupy a same number of bits as the 
second operation field and may include a combination 
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of one operation code and at least one operand, the 
VLIW processor further including: a third decoding unit 
which decodes, when an operation code is present in 
the third operation field, the operation code in the third 
operation field; and a third executing unit for executing 
an operation indicated by the operation code in the third 
operation field on data which is indicated by the oper- 
ands in the third operation field, in accordance with a 
decoding result of the third decoding unit. 

As a result, a VLIW processor with a high degree of 
parallelism whereby three operations can be simultane- 
ously performed can be achieved. 

Here, the first executing unit may control a control 
flow of a program including the instruction. 

As a result, branch operations which do not normal- 
ly require a large number of bits can be assigned to a 
short operation field. This means an instruction set with 
high code efficiency can be defined. 

Here, the second executing unit may control trans- 
fer of the data that is indicated by the operands included 
in the second operation field, and the third executing unit 
may control executes an arithmetic logic operation on 
the data that is indicated by the operands included in 
the third operation field. 

As a result, data transfer to and from an external 
memory can be indicated by a single operation in an in- 
struction, so that the operand access circuit that should 
be provided in a VLIW processor can be simplified. 

The second object of the present invention can be 
achieved by a VLIW processor that decodes and exe- 
cutes an instruction that has at least two operation fields, 
of which a first operation field can only include one of (i) 
a single operation code for specifying an operation type 
and (ii) a constant, and a second operation field includes 
one of (i) a combination of one operation code and at 
least one operand used in an operation indicated by the 
second operation field and (ii) a constant, the VLIW 
processor including: a first decoding unit which de- 
codes, when an operation code is present in the first op- 
eration field, the operation code in the first operation 
field; a first executing unit for executing an operation in- 
dicated by the operation code in the first operation field, 
in accordance with a decoding result of the first decod- 
ing unit; a second decoding unit which decodes, when 
an operation code is present in the second operation 
field, the operation code in the second operation field; 
and a second executing unit for executing an operation 
indicated by the operation code in the second operation 
field on data which is indicated by the operands in the 
second operation field, in accordance with a decoding 
result of the second decoding unit. 

With the stated construction, when it is necessary 
to put meaningless code into an operation field in an in- 
struction, a constant that will be used by a different op- 
eration may instead be inserted, so that a VLIW proc- 
essor can be realized for executing instructions which 
have a high code efficiency despite having only a short 
word length. 


Here, the instruction also includes a format field in- 
cluding a format code indicating whether only a constant 
is located in the first operation field and whether only a 
constant is located in the second operation field, the 

5 VLIW processor further including: a format decoding 
unit for decoding the format code; and a constant stor- 
age unit for extracting, when a decoding result of the 
format decoding unit shows that only a constant is 
present in at least one of the first operation field and the 

10 second operation field, the constant in the instruction 
and storing the extracted constant. 

As a result, constants placed in an operation field 
can be stored in the constant storage unit for use by an 
operation in a later instruction, so that decreases in code 

is efficiency can be avoided even when handling con- 
stants of a comparatively long word length using instruc- 
tions of a comparatively short word length. 

Here, the format field, the first operation field, the 
operation code in the second operation field, each op- 

20 erand in the second operation field, the operation code 
in the third operation field, and each operand in the third 
operation field may each occupy n bits in the instruction. 

With the stated construction, all of the fields that 
compose an instruction have the same number of bits, 

25 which enables the internal circuits of the VLIW proces- 
sor to be simplified. 

Here, a VLIW processor may include: a fetch unit 
for fetching an L-bit instruction that includes n operation 
fields; and n operation units which are each associated 

30 with a different one of the n operation fields in the 
fetched instruction and each independently execute an 
operation indicated in the associated operation field in 
parallel with each other; the VLIW processor being char- 
acterized by the n operation fields not all being a same 

35 size, and by L not being an integer multiple of n. 

With the stated construction, there is no need for all 
of the operation fields in an instruction to have the same 
word length, making it possible to define instructions 
with high code efficiency. As a result, a VLIW processor 

40 that executes instructions of comparatively short word 
length, but which have a highly efficient code structure 
so that several operations can be simultaneously indi- 
cated is achieved. 

Here, n may be 3 and L may be 32. 

45 The stated construction realizes a VLIW processor 
with a high degree of parallelism whereby three opera- 
tions that are specified by a single 32-bit instruction can 
be simultaneously performed. 

Here, a number of operands included in at least one 

so operation field out of the n operation fields may be dif- 
ferent to a number of operands in other operation fields 
in the n operation fields. 

With the stated construction, there is no need for 
every operation field in an instruction to have the same 

55 number of operands, so that instruction formats with a 
high degree of code efficiency can be defined. 

Here, the n operation fields may include at least one 
operation field composed of only an operation code and 
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at least one operation field composed of an operation 
code and at least one operand. 

With the stated construction, the instruction word 
length is shorter than the case when every operation 
field in an instruction contains a combination of an op- 
eration code and operands, so that a VLIW processor 
that executes instructions which have a highly efficient 
code construction can be realized. 

As described above, the present invention realizes 
a VLIW processor that executes instructions of compar- 
atively short word length but which have a highly effi- 
cient code structure that allows several operations to be 
specified by a single instruction. This effect is especially 
noticeable for embedded processors that process mul- 
timedia data. 

BRIEF DESCRIPTION OF THE DRAWINGS 

These and other objects, advantages and features 
of the invention will become apparent from the following 
description thereof taken in conjunction with the accom- 
panying drawings which illustrate a specific embodi- 
ment of the invention. In the drawings: 

Fig. 1 A and 1 B show instruction formats used under 
the prior art, with Fig. 1 A showing an instruction for- 
mat where two operations are simultaneously indi- 
cated and Fig. 1B showing an instruction format 
where one only operation is indicated; 
Fig. 2A shows the field structure of an instruction 
that is executed by the processor of the present in- 
vention; 

Figs. 2B to 2D show sixteen types of instruction for- 
mat, with Fig. 2B showing triple operation instruc- 
tions, Fig. 2C showing twin operation instructions, 
and Fig. 2D showing single operation instructions; 
Fig. 3 is a table showing specific operations that are 
indicated by the three types of operation code, "cc", 
"opl", and "op2", that are used in Figs. 2B to 2D; 
Fig. 4 is a block diagram showing the hardware con- 
struction of the present processor; 
Fig. 5 is a block diagram showing the detailed con- 
struction of the constant register 36 of the present 
processor and the peripheral circuits; 
Figs. 6A to 6D are representations of different meth- 
ods for storing a constant by the constant register 
control unit 32 shown in Fig. 5, with Fig. 6A showing 
the case when the format code is •0" or "1 \ Fig. 6B 
showing the case when the format code is "4", Fig. 
6C showing the case when the format code is "5°, 
and Fig. 6D showing the case when the format code 
is '2", "3", or "A"; 

Fig. 7 is a block diagram showing the detailed con- 
struction of the PC unit 33 of the present processor; 
Fig. 8 is a flowchart showing a procedure that han- 
dles a 32-bit constant; 

Fig. 9 shows an example of a program that has the 
present processor execute the procedure shown in 


Fig. 8; 

Fig. 10 is a timing chart showing the operation of 
the present processor when executing the program 
shown in Fig. 9; 
5 Fig. 11 is an example of a program that has the 
present processor execute a procedure that han- 
dles a 16-bit constant; 

Fig. 12A shows the field definition of instructions 
that are executed by a standard processor; 
10 Fig. 1 2B shows the instruction format of the instruc- 
tions shown in Fig. 12 A; 

Fig. 1 3 shows an example of a program that has a 
standard processor perform the same procedure as 
the program shown in Fig. 9; 
is Fig. 1 4 shows an example of a program that has a 
standard processor execute the same procedure as 
the program shown in Fig. 11 ; 
Figs. 1 5 A to 1 5D show modifications to the structure 
o the instructions executed by the VLIW processor 
20 of the present invention; and 

Fig. 16 shows a modification to the hardware con- 
struction of the present processor to enable the ex- 
ecution of the instruction shown in Fig. 15A. 

25 DESCRIPTION OF THE PREFERRED EMBODIMENT 

An embodiment of the processor of the present in- 
vention is described below with reference to the figures, 
in this embodiment, the expression "instruction" refers 
30 to a set of code that is decoded and executed by the 
present processor simultaneously and in parallel, with 
the expression "operation" refers to a unit of processing, 
such as an arithmetic operation, a logic operation, a 
transfer, or a branch, which is executed by the present 
35 processor in parallel, as well as to the code which indi- 
cates each unit of processing. 

Instruction Format 

to First, the structure of the instructions that are de- 
coded and executed by the present processor will be 
described. The present processor is a VLIW processor 
that decodes and executes instructions with a fixed word 
length of 32 bits. 

45 Fig. 2A shows the field structure of an instruction 
50 to be executed by the present processor. Figs. 2B to 
2D, meanwhile, show sixteen instruction formats. Of 
these, the instruction formats in Fig. 2B simultaneously 
indicate three operations, the instruction formats in Fig. 

50 2C two operations, and the instruction formats in Fig. 
2D a single operation. 

This instruction 50 has a fixed word length of 32 bits 
and is composed of eight 4-bit physical fields shown in 
order starting from the MSB (Most Significant Bit) as 

55 po.O field 51, P1.0 field 52, ... P3.2 field 58 in Fig. 2A. 
Of these, the range from the P2.0 field 53 to the P2.2 
field 55 is called the first operation field 59, while the 
range from the P3.0 field 56 to the P3.2 field 58 is called 
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the second operation field 60. 

In Figs. 2Bto2D, the legend "const" indicates a con- 
stant, and depending on the operation in which it is used, 
this can be a numeric constant or a character constant 
such as an immediate, an absolute address, or a dis- s 
placement. The legend "op" represents an operation 
code that indicates an operation type, while the legend 
■Rs" indicates the register used as the source operand, 
"Rd" the register used as the destination operand, and 
"cc" an operation code indicating a branch operation 10 
that uses the stored value of a specialized 32-bit register 
provided in the present processor (the constant register 
36 shown in Fig. 4) as the absolute address or relative 
address (displacement) of a branch destination. 

The numerical values given directly after the codes is 
described above show values that are used in the oper- 
ation in either the first operation field 59 or the second 
operation field 60. As one example, for the instruction 
format with the format code "6", the 4-bit constant "con- 
stP located in the P1.0 field 52 and the 4-bit constant 20 
"constl" located in the P2. 1 field 54 are combined to form 
an 8-bit constant that is the source operand correspond- 
ing to the operation code "op1 " of the first operation field 
59. 

The constant "const" which is not appended with a 2s 
number represents a constant to be stored in the spe- 
cialized 32-brt register provided in the present processor 
(the constant register 36 shown in Fig. 4). As one ex- 
ample, for the instruction format with the format code 
"0", the 4-bit constant "const" located in the P1 .0 field 30 
52 implies the constant that is to be stored in the con- 
stant register 36 which is implicitly indicated. 

Fig. 3 shows specific examples of the operations 
that can be indicated by the three kinds of operation 
code "cc", "op1", and "op2" given in Figs. 2B to 2D. 35 
These operations are described in detail below. 

The 4-bit operation code "cc" indicates one out of 
sixteen types of branch instruction. Each branch instruc- 
tion is specified as a branch condition and a branch for- 
mat. Examples of branch conditions include "equal to *o 
('eq')", "not equal to ('neq')", and "greater than ('gt')". The 
branch format can be a format where the stored value 
of the constant register 36 serves as the absolute ad- 
dress of the branch destination (denoted by having no 
"i" attached to the instruction mnemonic), or a format 
where the stored value of the constant register 36 
serves as a relative address (denoted by having V at- 
tached to the instruction mnemonic). As one example, 
the operation code "eq" represents an operation that 
branches to a destination indicated through absolute ad- so 
dressing when a preceding comparison finds the com- 
pared values to be equal, while the operation code "eqi" 
represents an operation that branches to a destination 
indicated through relative addressing when a preceding 
comparison finds the compared values to be equal. 55 

The 4-bit operand "op 1" can be used to indicate an 
arithmetic logic operation, such as any of an "add" (ad- 
dition), a "sub" (subtraction), a "mul" (multiplication), an 
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"and" (logical AND), or an "or" (logical OR), or an oper- 
ation that is an inter-register transfer, such as any of a 
"mov" (transfer of word (32 -bit) data), a "movh (transfer 
of halfworddata), or a "movb" (transfer of one byte data). 

The 4-bit operand "op2" can be used to indicate any 
of the arithmetic logic operations or inter-register trans- 
fers that can be indicated by the operand "op1 but can 
also be used to indicate a register-memory transfer op- 
eration such as an "Id" (load of one word data from mem- 
ory into registers) or an "sf (store of one word data into 
memory from registers). 

The characteristic features of the fields 51 , 52, 59, 
and 60 shown in Fig. 2A are described below. 

The PO O field 51 holds a 4-bit format code that 
specifies the format of the instruction 50. More specifi- 
cally, this PO.O field 51 specifies one of the sixteen in- 
struction formats shown in Figs. 2B to 2D. 

The P1 .0 field 52 is a field holds a constant or an 
operation code for a branch operation. When a constant 
is located in the P1 .0 field 52 (such as in the instructions 
with the format codes "0", "1", and "4" to "9") there are 
cases where the constant is to be stored in the constant 
register 36 (such as in the instructions with the format 
codes "0", "1", "4", and "5"), and cases where the con- 
stant forms one part of the operand in the first operation 
field 59 or the second operation field 60 (such as in the 
instructions with the format codes "5", "7", "8", "9", and 
"B"). When the constant in the P1.0 field 52 is to be 
stored in the constant register 36, there are cases where 
only this 4-bit constant is stored (such as in the instruc- 
tions with the format codes "0" and "1"), and cases 
where this constant is stored together with a 12-bit con- 
stant located in either the first operation field 59 or the 
second operation field 60 (such as in the instructions 
with the format codes "4" and "5"). 

When the operation code "cc" for branching is given 
in the P1 .0 field 52 (such as in the instructions with the 
format codes "2", "3", and "A"), this indicates a branch 
operation that uses the stored value of the constant reg- 
ister 36 as the absolute address or relative address (dis- 
placement) of a branch destination. 

The first operation field 59 holds either a constant 
or a combination of (a) an operation code for indicating 
an operation (such as an arithmetic logic operation or 
inter-register transfer) that does not involve data trans- 
fer between the present processor and the periphery 
(memory), and (b) source and destination operands for 
the operation. 

The second operation field 60 can hold the same 
content as the first operation field 59 described above, 
but can also alternatively hold a combination of (a) an 
operation code for indicating an operation (such as 
memory-register transfer) that involves data transfer be- 
tween the present processor and the periphery and (b) 
operands for the operation. 

The above assignment of different operation types 
to certain fields rests on the premises for the present 
von Neumann-type processor whereby it is not neces- 
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sary to process two or more branch operations simulta- 
neously, and that only one input/output port (the oper- 
and access unit 40 shown in Fig. 4) for transferring op- 
erands is provided between the present processor and 
the periphery (memory). 

The instruction formats shown in Figs. 2B to 2D 
have the following characteristic features. 

First, by focusing on the constant "const", it can be 
seen that there are the following three types of instruc- 
tion for storing a constant in the constant register 36. 

(1) When the format code is a 0 m or "1 ": 

In these instructions, the 4-bit constant located 
in the P1 .0 field 52 is stored in the constant register 
36. 

(2) When the format code is "4": 

In this instruction, a 16-bit constant located in 
the P1 .0 field 52 to P2.2 field 55 is stored in the con- 
stant register 36. 

(3) When the format code is "5": 

In this instruction, a 16-bit constant located in 
the P1 .0 field 52 and the P3.0 field 56 to P3.2 field 
58 is stored in the constant register 36. 

Secondly, for the present processor, a maximum of 
three operations can be indicated by a single instruction, 
and in this case, as can be seen from the triple operation 
formats shown in Fig. 2B, either of the following combi- 
nations of operation types can be used. 

(1) One operation that sets a 4-bit constant into the 
constant register 36 and two standard operations 
(when the format code is "0" or "1 "). 

(2) One operation that performs branching using the 
value set in the constant register 36 as an absolute 
address or a relative address and two standard op- 
erations (when the format code ■2° or "3"). 

As described above, the instructions of present 
processor have a highly efficient field structure that en- 
ables a maximum of three operations to be simultane- 
ously indicated by a single 32-bit instruction. 

Hardware Construction of the Processor 

The hardware construction of the present processor 
is described below. 

Fig. 4 is a block diagram showing the hardware con- 
struction of the processor of the present invention. As 
described above, this processor is a VLIW processor 
that can execute a maximum of three operations in par- 
allel. The construction of the processor can be roughly 
divided into an instruction register 10, a decoder unit 20, 
and an execution unit 30. 

The instruction register 10 is a 32 -bit register that 
stores one instruction that has been sent from the in- 
struction fetch unit 39. 

The decoder unit 20 decodes the instruction held in 


the instruction register 10 and performs output on con- 
trol lines to the execution unit 30 in accordance with the 
decoding result. This decoder unit 20 can itself be 
roughly divided into the format decoder 21 and the in- 
5 struction decoder 22. 

The instruction decoder 22 is composed of a branch 
decoder 23 that decodes the "cc" operation code held 
in the P1 .0 field 12 and controls the PC unit 33 accord- 
ingly, a first operation decoder 24 that decodes the op- 
10 eration code held in the P2.0 field 1 3 and controls the 
first operation unit 37 accordingly, and a second opera- 
tion decoder 25 that decodes the operation code held 
in the P3.0 field 16 and controls the second operation 
unit 38 and operand access unit 40 accordingly. 
is The format decoder 21 decodes the 4-bit format 
code held in the PO O field 11 to identify the instruction 
format of the instruction held in the instruction register 
10 as one of the sixteen possible instruction formats 
shown in Figs. 2B to 2D. In accordance with the decod- 
20 ing result, the format decoder 21 permits or prohibits de- 
coding operations by the branch decoder 23, the first 
operation decoder 24, and the second operation decod- 
er 25, and activates the register control unit 32 of the 
execution unit 30. 
25 The format decoder 21 , the branch decoder 23, the 
first operation decoder 24, and the second operation de- 
coder 25 fundamentally decode one operation in one cy- 
cle and send control signals to the execution unit 30. 
Here, the 26-bit constant signal line 26 that connects the 
30 instruction register 1 0 with the execution unit 30 is a bus 
for transferring constants and operands located in the 
instruction register 10 to the execution unit 30. 

The execution unit 30 operates according to the de- 
coding result of the decoder unit 20 and is a circuit that 
35 js capable of executing a maximum of three operations 
in parallel. This execution unit 30 is composed of an ex- 
ecution control unit 31, a PC unit 33, a register set 34, 
a first operation unit 37, a second operation unit 38, an 
instruction fetch unit 39, and an operand access unit 40. 
40 Out of the components in the execution unit 30, the con- 
structions of the register control unit 32, the PC unit 33, 
and the constant register 36 are shown in greater detail 
in the other drawings. 

The execution control unit 31 refers in general to 
45 the control circuits and wiring for controlling the compo- 
nents numbered 33 to 40 in the execution unit 30 ac- 
cording to the decoding result of the decoder unit 20. 
This execution control unit 31 includes the components 
that are normally provided in a processor, such as cir- 
50 cuits for timing control, operation permission/prohibition 
control, status management, and interruption control, as 
well as the constant register control unit 32 which is a 
characteristic component of the present processor. The 
constant register control unit 32 performs control so that 
55 a 4- or 1 6-bit constant 'const' held in the instruction reg- 
ister 10 is stored in the constant register 36 based on 
indications given by the format decoder 21 . 

The PC (Program Counter) unit 33 operates under 
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the control of the branch decoder 23, and outputs the 
address in an external memory (not illustrated) of the 
next instruction to be decoded and executed to the in- 
struction fetch unit 39. 

The instruction fetch unit 39 fetches an instruction 
block from the external memory (not illustrated) via a 
32-bit I A (Instruction Address) bus and a 32-bit ID (In- 
struction Data) bus. The instruction fetch unit 39 stores 
the fetched instruction block in an internal instruction 
cache and supplies the instruction which corresponds 
to the address outputted by the PC unit 33 to the instruc- 
tion register 10. 

The register set 34 is composed of fifteen 32-bit 
general registers 35 and one 32-bit constant register 36. 
In accordance with the decoding results of the first op- 
eration decoder 24 and the second operation decoder 
25, the values which are stored in these sixteen regis- 
ters 35 and 36 are transferred to the first operation unit 
37 and the second operation unit 38 where an operation 
is performed or alternatively the values are allowed to 
pass, before being sent to the register set 34 or the op- 
erand access unit 40. Here, in addition to being used in 
the operations performed by the first operation unit 37 
and the second operation unit 38, the value stored in the 
constant register 36 can also be transferred to the PC 
unit 33, where it is used to generate an effective address 
that is used as a branch destination. 

The first operation unit 37 internally includes an 
ALU (Arithmetic Logic Unit) for performing arithmetic 
logic operations on two 32-bit sets of data and a multi- 
plier for performing multiplications on two 32-brt sets of 
data. This first operation unit 37 is capable of executing 
two types of operation (namely, arithmetic logic opera- 
tions, and inter-register transfer operations) under the 
control of the first operation decoder 24. 

The second operation unit 38 internally includes an 
ALU for performing arithmetic logic operations on two 
32-bit sets of data and a multiplier for performing multi- 
plications on two 32-bit sets of data, in the same way as 
the first operation unit 37. This second operation unit 38 
is capable of executing two types of operation (namely, 
arithmetic logic operations, and inter-register transfer 
operations) under the control of the second operation 
decoder 25. 

The operand access unit 40 operates under the 
control of the second operation decoder 25 and is a cir- 
cuit that transfers operands between the register set 34 
and the external memory (not illustrated). The operand 
access unit 40 internally includes a buffer for storing op- 
erands and operand addresses. As a specific example, 
when the operation code "Id" is in the P3.1 field 16 of 
the instruction register 10, one word of data that is lo- 
cated in the external memory is loaded via the operand 
access unit 40 into one of the registers in the register 
set 34. When the operation code "st" is present, mean- 
while, the stored value of one of the registers in the reg- 
ister set 34 is stored in the external memory. 

The PC unit 33, the register set 34, the first opera- 


tion unit 37, the second operation unit 38, and the oper- 
and access unit 40 are connected by internal buses (the 
L1 bus, the R1 bus, the L2 bus, the R2 bus, the D1 bus, 
and the D2 bus) as shown in Fig. 4. Here, the L1 bus 

s and the R1 bus are each connected a respective one of 
the two input ports of the first operation unit 37, the L2 
bus and the R2 bus are each connected to a respective 
one of the two input ports of the second operation unit 
38, and the D1 bus and the D2 bus are respectively con- 

10' nected to an output port of the first operation unit 37 and 
the second operation unit 38. 

Detailed Construction of the Constant Register 36 and 
its Periphery 

15 

The following is a detailed description of the con- 
struction of the constant register 36 and of the peripheral 
circuits. 

Fig. 5 is a block diagram showing the detailed con- 

20 struct ion of the constant register 36 and of the peripheral 
circuits. Note here that the fixed value ( m 0 m ) 27 in the 
drawings refers to fixed wiring for four signal lines car- 
rying the constant "0". 

The constant register control unit 32 is composed 

25 of five 3-input selectors 32a-32e and three 4-input se- 
lectors 32f-32h, while the constant register 36 is com- 
posed of eight 4-bit registers 36a-36h. Here, each set 
of input and output data is 4-bit parallel data. 

In accordance with control signals from the format 

30 decoder 21 and the instruction decoder 22, the constant 
register control unit 32 controls the eight input selectors 
32a-32h so that a constant stored in the instruction reg- 
ister 10 or zeros are stored in the constant register 36 
according to one of the four storage methods given be- 

35 low. 

Figs. 6 A to 6 D show the four possible storage meth- 
ods in the present embodiment. 

Fig. 6A shows a storage method for when the format 
decoder 21 detects that the value stored in the PO.O field 

<o 11 is "0" or "1". This equates to the case when only a 
4-bit constant located in the P1 .0 field 1 2 is stored in the 
constant register 36. More specifically, the data that is 
stored in the constant register 36 is shifted upwards (to 
the left in Fig. 6A) in 4-bit units and the 4-bit constant 

45 stored in the P1 .0 field 1 2 of the instruction register 10 
is stored in the lowest-order 4-bit register 36h of the con- 
stant register 36. 

Fig. 6B shows a storage method for when the format 
decoder 21 detects that the value stored in the PO.O field 

so 11 is "4 D . This equates to the case when a 16-bit con- 
stant located between the P1.0 field 12 and the P2.2 
field 1 5 is stored in the constant register 36. More spe- 
cifically, the data that is stored in the lower 16 bits 36e- 
36h of the constant register 36 is shifted to the upper 1 6 

55 bits 36a-36d and the 16-bit constant located between 
the P1 .0 field 1 2 and the P2.2 field 1 5 of the instruction 
register 10 is stored in the lowest-order 16-bits 36e-36h 
of the constant register 36. 
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Fig. 6C shows a storage method for when the format 
decoder 21 detects that the value stored in the PO.O field 
11 is "5°. This equates to the case when a 16-bit con- 
stant located in the P1 .0 field 12 and between the P3.0 
field 16 and the P3.2 field 18 is stored in the constant 
register 36. More specifically, the data that is stored in 
the lower 16 bits 36e-36h of the constant register 36 is 
shifted to the upper 16 bits 36a-36d and the 16-bit con- 
stant located in the P1 .0 field 12 and between the P3.0 
field 16 and the P3.2 field 18 of the instruction register 

10 is stored in the lowest-order 16-bits 36e-36h of the 
constant register 36. 

Fig. 6D shows a storage method for when the format 
decoder 21 detects that the value stored in the PO.O field 

11 is "2°, "3°, or "A", or when the instruction decoder 22 
detects that the constant register (R1 5) is indicated by 
at least one of the P2.1 field 14, the P2.2 field 15, the 
P3.2 field 17, and the P3.3 field 18. This equates to the 
case where the value stored in the constant register 36 
is reset to all zeros (which is to say, the constant register 
36 is cleared), after the stored value of the constant reg- 
ister 36 has been used by at least one of a branch op- 
eration located in the P1.0 field 12, an operation in the 
first operation field 59 or an operation in the second op- 
eration field 60. More specifically, immediately after the 
stored value of the constant register 36 has been read 
out to one of the PC unit 33, the first operation unit 37 
or the second operation unit 38, a 32-brt constant with 
the value "0" is written into the constant register 36. 

Here, the value in the constant register 36 is cleared 
after being used to ensure that a value with a zero ex- 
tension is always stored in the constant register 36. A 
zero extension here refers to the insertion of zeros that 
is performed when the effective number of bits of a value 
is below a predetermined number of bits, with zeros be- 
ing inserted into the higher bit positions so that the value 
takes up the predetermined number of bits. 

As described above, when the value in the PO.O field 
1 1 of the instruction register 1 0 is "0", "1 "4", or "S", the 
constant that is already stored in the constant register 
36 is shifted and a new value is stored. Also, after the 
value stored in the constant register 36 is read out and 
used, this stored value is deleted. By doing so, the con- 
stant register 36 is able to successively accumulate con- 
stants until the next time its stored content is used. 

Detailed Construction of the PC Unit 33 

The following is a detailed description of the con- 
struction of the PC unit 33. 

Fig. 7 is a block diagram showing the construction 
of the PC unit 33 in detail. As shown in Fig. 7, the PC 
unit 33 is composed of a fixed value ("4") 33a, that is 
wiring which permanently carries the constant "4", a 
2-input selector 33b, an adder 33c, a PC (Program 
Counter) 33d for storing an address of the next instruc- 
tion to be decoded and executed, and a 4-input selector 
33e. 


In the PC unit 33, the selectors 33b and 33e operate 
in accordance with control signals from the decoder unit 
20, so that the selector 33e outputs one of the following 
three types of values to the instruction fetch unit 39 as 
5 the effective address. 

1 . A value where '4° is added to the content of the PC 
33d 

10 This corresponds to when no branch is taken and a 
next instruction is to be executed in order, which is to 
say, when the decoding result for a present instruction 
is that no branch operation is indicated. The reason "4" 
is added is that the length of one instruction is four bytes, 
which is to say, 32 bits. 

2. A value where the content of the constant register 36 
is added to the content of the PC 33d 

This corresponds to when the content of the con- 
stant register 36 is used as a relative address for branch- 
ing, such as when the decoding result of the branch de- 
coder 23 is that the P1 .0 field 12 indicates a branch to 
a relative address. 

3. A value given as the content of the constant register 
36 

This corresponds to when the content of the con- 
stant register 36 is used as an absolute address for 
branching, such as when the decoding result of the 
branch decoder 23 is that the P1 .0 field 12 indicates a 
branch to an absolute address. 

As described above, the PC unit 33 includes a spe- 
cialized adder 33c, and is constructed to directly use the 
value stored by the constant register 36, so that branch 
execution control can be performed with the stored val- 
ue of the constant register 36 as a relative address or 
an absolute address in parallel with and independent of 
the operations performed by the first operation unit 37 
and the second operation unit 38. 

Operation of the Processor 

The following is a description of the operation of the 
present processor when decoding and executing spe- 
cific operations. 

Fig. 8 is a flowchart showing an example of a pro- 
cedure that handles 32-brt constants. First, the differ- 
ence between the stored values of the registers R0 and 
R1 is found (step S80), and the result is multiplied by 
the stored value of R2 (step S81). The 32-bit constant 
"0x87654321 B (the value "87654321 " in hexadecimal) is 
then added to the result of this (steps S82, S83), and 
finally the register R0 is cleared (step S84). 

Fig. 9 shows an example of a program that has the 
present processor perform the procedure shown in Fig. 
8. The program is composed of the three instructions 
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71-73. In Fig. 9, one line corresponds to one instruction, 
and the content of each instruction is shown by mne- 
monics located in the separate fields of each instruction. 
In Fig. 9, the value of each constant is expressed in hex- 
adecimal. Also, the legend fmtn (n=0-F)" shows the for- 
mat code "n", while the legend "Rn (n=0-1 5) m shows the 
value stored in one of the registers in the register set 
34. Of these, "R15" refers to the constant register 36. 

Fig. 10 is a timing chart showing the operation of 
the present processor when executing the program 
shown in Fig. 9. This Fig. 1 0 shows the clock cycles, the 
content of the general registers R0-R3 and the register 
R15, and the data that flows on the four buses L1 , R1 , 
L2, and R2. 

The following is an explanation of the operation of 
the present processor for each of the instructions 71 to 
73, with reference to Figs. 9 and 10. 


11 in the instruction register 10 that the present instruc- 
tion is a twin operation instruction with the format code 
"4", and so controls the execution unit 30 so that the two 
operations described below are executed in parallel. 

1. First Operation 

The constant register control unit 32 controls its 
eight internal selectors 32a-32h so that the 16-bit con- 
stant (0x4321) located between the P1.0 field 12 to the 
P2.2 field 15 is stored in the lower 16 bits of the constant 
register 36 according to the storage method shown in 
Fig. 6B. Accordingly, the content of the register R15 
changes from "0x00008765 * to "0x87654321 " as shown 
in the clock cycles t1-t2 in Fig. 10. 

2. Second Operation 


10 


Instruction 71 

After the instruction 71 has been loaded into the in- 
struction register 1 0, the present processor performs the 
operations shown in the clock cycles tO-tl in Fig. 10. The 
format decoder 21 judges from the value "fmt4" of the 
PO.O field 11 in the instruction register 10 that the 
present instruction is a twin operation instruction with 
the format code '4*, and so controls the execution unit 
30 so that the two operations described below are exe- 
cuted in parallel. 

1 . First Operation 

The constant register control unit 32 controls its 
eight internal selectors 32a-32h so that the 16-bit con- 
stant (0x8765) located between the P1.0 field 12 to the 
P2.2 field 1 5 is stored in the lower 16 bits of the constant 
register 36 according to the storage method shown in 
Fig. 6B. Accordingly, the content of the register R15 
changes from "0x00000000" to "0x00008765" as shown 
in the clock cycles tO-tl in Fig. 10. 

2. Second Operation 

The second operation unit 38 receives an input of 
the stored value "0x33333333° of the general register 
R0 and the stored value "0x22222222" of the general 
register R1, and after subtracting the latter from the 
former, stores the result in the general register R0. As 
a result, the stored content of the general register R0 
changes from the value "0x33333333" to the value 
"Oxllllllll" in the clock cycles t0-t1 shown in Fig. 10. 

Instruction 72 

Next, after the instruction 72 has been loaded into 
the instruction register 10, the present processor oper- 
ates as shown in clock cycles t1-t2 in Fig. 10. The format 
decoder 21 judges from the value "fmt4" of the PO.O field 


The second operation unit 38 receives an input of 
20 the stored value "0x00000004" of the general register 
R2 and the stored value "0x1 1111111' of the general reg- 
ister R0, and multiplies the two together before storing 
the result in the general register R0. As a result, the 
stored content of the general register R0 changes from 
25 the value "0x1 1 1 1 1 1 1 1" to the value "0x44444444" in the 
clock cycles t1-t2 shown in Fig. 10. 

Instruction 73 

30 Next, after the instruction 73 has been loaded into 
the instruction register 10, the present processor oper- 
ates as shown in clock cycles t2-t3 in Fig. 1 0. The format 
decoder 21 judges from the value "f mt7" of the PO.O field 
11 in the instruction register 10 that the present instruc- 
ts tion is a twin operation instruction with the format code 
"7", and so controls the execution unit 30 so that the two , 
operations described below are executed in parallel. 

1 . First Operation 

40 

The first operation unit 37 receives an input of the 
stored value "0x87654321" of the general register R15 
and the stored value "0x44444444" of the general reg- 
ister R0, and adds the two together before storing the 
45 result in the general register R0. As a result, the stored 
content of the general register R0 changes from the val- 
ue "0x44444444" to the value "0xCBA98765" in the 
clock cycles t2-t3 shown in Fig. 10. 


The second operation unit 38 receives an input of 
the 8-bit constant ("0x00") that is located in the P1 .0 field 
1 2 and the P3.1 field 17 and allows this constant to pass 
ss so that it is stored in the general register R3. As a result, 
the content of the general register R3 changes from the 
previously held value "0xFEDCBA98" to "0x00000000", 
as shown for the clock cycles t2-t3 in Fig. 10. 
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As described above for the present processor, the 
32-bit constant "0x87654321 " is split into two parts that 
are arranged into the two instructions 71 and 72, with 
these parts being successively stored in the constant 
register 36 by shifting its stored value. This stored con- 
stant is then used according to the third instruction, in- 
struction 73. By doing so, the procedure shown in the 
flowchart of Fig. 8 can be executed by the three instruc- 
tions 71-73. 

The following is an explanation of the operation of 
the present processor using a different program that 
deals with 16-bit constants. 

Fig. 11 shows an example of a program that handles 
a 16 bit constant This program is composed of the five 
instructions 74 to 78. 

The operation of the present processor for the each 
of the instructions 74 to 78 is as described below. 

Instruction 74 

When the instruction 74 has been loaded into the 
instruction register 10, the format decoder 21 judges 
from the value "frntO" of the PO.O field 11 in the instruc- 
tion register 10 that the present instruction is a triple op- 
eration instruction with the format code "0", and so con- 
trols the execution unit 30 so that the three operations 
described below are executed in parallel. 

1 . First Operation 

The constant register control unit 32 controls its 
eight internal selectors 32a-32h so that the 4-bit con- 
stant (■0x8") located in the P1 .0 field 12 is stored in the 
lowest 4 bits of the constant register 36 according to the 
storage method shown in Fig. 6A. 

2. Second Operation 

The first operation unit 37 receives an input of the 
stored value of the general register R6, and allows this 
value to pass so that it is stored in the general register 
R1. 

3. Third Operation 

In the same way, the second operation unit 38 re- 
ceives an input of the stored value of the general register 
R7, and allows this value to pass so that it is stored in 
the general register R2. 

Instruction 75 

When the instruction 75 has been loaded into the 
instruction register 10, the format decoder 21 judges 
from the value "frntO" of the PO.O field 11 in the instruc- 
tion register 1 0 that the present instruction is a triple op- 
eration instruction with the format code "0", and so con- 
trols the execution unit 30 so that the three operations 


described below are executed in parallel. 

1 . First Operation 

s The constant register control unit 32 controls its 
eight internal selectors 32a-32h so that the 4-bit con- 
stant ("0x7") located in the P1 .0 field 12 is stored in the 
lowest 4 bits of the constant register 36 according to the 
storage method shown in Fig. 6 A. After this operation, 
io the constant "0x87" is set in the lowest 8 bits of the con- 
stant register 36. 

2. Second Operation 

The first operation unit 37 receives an input of the 
stored values of the general register R0 and the general 
register R1, and adds these values together. The first 
operation unit 37 stores the addition result in the general 
register R1 . 

3. Third Operation 

In the same way, the second operation unit 38 re- 
ceives an input of the stored values of the general reg- 
ister R0 and the general register R2, and adds these 
values together. The second operation unit 38 stores the 
addition result in the general register R2. 

Instructions 76, 77 

Instructions 76 and 77 are executed in the same 
way as described above, and as a result the constant 
"0x8765" is stored in the lower 16 bits of the constant 
register 36. 

Instruction 78 

Once the instruction 78 has been loaded into the 
instruction register 10, the present processor operates 
in the same way as when processing instruction 73. 

As described above for the present processor, the 
16-bit constant B 0x8765" is split into four parts that are 
arranged into the instructions 74-77, with these parts be- 
ing successively stored in the constant register 36 by 
shifting its stored value. This stored constant is then 
used according to the fifth instruction, instruction 78. 

Comparison with a Standard Processor 

The following is a description of the processing per- 
formed by a standard processor for a program with the 
same processing content as shown in Figs. 9 and 11 
and a comparison with the processing of the present in- 
vention. Here, the expression "standard processor" re- 
fers to a processor that executes instructions whose 
word length is fixed at 32 bits, and is the same as the 
present processor, except for the lack of a construction, 
such as the constant register 36 and the constant reg- 
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ister control unit 32, for accumulating constants that 
have been divided between instructions. 

Fig. 1 2A shows the field definition of the instructions 
that are executed by a standard processor, while Fig. 
1 2B shows the format of the instructions. Here, it is sup- 
posed that the standard processor can execute three 
types of twin operation instruction, instructions 101 -103, 
and one type of single operation instruction, instruction 
104. 

Fig. 13 shows an example of a program to be exe- 
cuted by the standard processor. This program has the 
same processing content as the program shown in Fig. 
9, which is to say the same procedure as the flowchart 
shown in Fig. 8. 

As can be seen by comparing Fig. 13 and Fig. 9, 
the program for the standard processor includes two 
more instructions that the program for the processor of 
the present invention. 

The reason the "NOP codes* are included in the in- 
structions 105 and 106 is that the instruction 106 uses 
the operation result of the instruction 105, so that these 
instructions cannot be executed in parallel. Also, the 
reason the constant '0x87654321 " is divided into an up- 
per 1 6 bits and a lower 1 6 bits that are set in the constant 
register Ri (instructions 107 and 1 08) is that it is not pos- 
sible to set a 32 -bit constant and an operation code for 
a setting instruction in a single 32-bit instruction. 

Fig. 14 also shows an example of a program for a 
standard processor. This program has the same 
processing content as program shown in Fig. 11 . As can 
be seen by comparing Fig. 14 and Fig. 11, the program 
for the standard processor includes one more instruc- 
tion than the program for the processor of the present 
invention. 

As described above, the instructions executed by 
the processor of the present invention have a highly ef- 
ficient field structure whereby a maximum of three op- 
erations can be indicated using a comparatively short 
word length of 32 bits. 

Accordingly, with the processor of the present in- 
vention, a 1 6-bit or 32-bit constant that has been divided 
across a plurality of instructions can be accumulated in 
the constant register 36 to restore the constant to its 
original form, with it then being used for a branch oper- 
ation or arithmetic logic operation. 
Accordingly, when a small region is available in an in- 
struction, this region can be effectively used for locating 
a part of a constant, so that the code size of the program 
can be reduced compared with when the same process- 
ing is performed by a standard processor. 

Modification 

Figs. 15A to 15D show the instruction formats of 
VLIW processors that are modifications of the present 
invention. In these figures, the smallest interval demar- 
cated by the vertical lines represents one bit, while the 
legend "fmt - shows the format field. 


The instruction shown in Fig. 15A is composed of a 
5-bit format field, a 7-bit operation field, and two 10-bit 
operation fields. The instruction shown in Fig. 15B, 
meanwhile, is composed of a 2-bit format field, a 4-bit 
s operation field, and two 1 3-bit operation fields. The in- 
struction shown in Fig. 1 5C is composed of a 3-bit format 
field, a 3-bit operation field, and two 13-bit operation 
fields. Finally, the instruction shown in Fig. 15D is com- 
posed of a 4-bit format field, a 2-bit operation field, and 
w two 1 3-bit operation fields. 

These four types of instruction have the following 
similarities with the instruction 50 described in the above 
embodiment and shown in Fig. 2A. 

'5 j. The instruction word length is fixed at 32 bits. 

ii. Each instruction has one format field and three 
operation fields. 

iii. The three operation fields do not have the same 
structure, which is to say, two of the operation fields 

20 have the same structure, while the remaining oper- 
ation field is short. 

As a result, these four types of instruction have the 
same characteristics as the instruction 50 described in 
25 the above embodiment. 

I. The instructions have a field structure whereby a 
maximum of three operations can be indicated in 
spite of the comparatively short 32-bit word length. 

30 ||. A small operation field which is ideal for inserting 
small instructions, such as branch instructions that 
do not require two operands, is provided, so the 
code efficiency of instructions is high. 
III. A format field is provided so that by giving an 

35 indication that a constant or one part of a constant 
is present in an operation field where an NOP in- 
struction would normally need to be provided, the 
code size of a program can be reduced. 

40 On the other hand, the four types of instructions de- 
scribed above also have the following characteristics 
which differ from those of the instruction 50 described 
in the above embodiment. The instruction shown in Fig. 
15A has the advantages of an enlarged format field, 

45 meaning a greater number of instruction types can be 
defined, and of the ability to provide at least one operand 
in each of the three operation fields. The instructions 
shown in Fig. 1 5B to Fig. 1 5D, meanwhile, have the ad- 
vantage of two enlarged operation codes fop2" and 

50 p op3"), so that a greater variety of operations can be de- 
fined. 

Fig. 16 is a block diagram showing the construction 
of a VLIW processor of the present invention for execut- 
ing the instruction shown in Fig. 15A. As can be seen 
55 by comparing this construction with the construction 
shown in Fig. 4, the fundamental construction of the 
processor is the same, although there are some differ- 
ences in the connections between the instruction regis- 
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ters 10, 110 and the decoder units 20, 120. In this way, 
VLIW processors for executing the modified instructions 
shown in Figs. 15A to 15D can be realized by making 
only partial changes to the VLIW processor described 
in the above embodiment. 

The processor of the present invention has been ex- 
plained by means of the embodiment given above, al- 
though it should be obvious that several further modifi- 
cations are possible. Four examples of such are given 
below. 

(1) The embodiment and above modification of the 
present invention all deal with the case where the 
instruction word length is 32 bits and a maximum of 
three operations are indicated, although the present 
invention is not limited to these numbers. 

As one example, the instruction 50 shown in 
Fig. 2A may further include another 4-bit operation 
code and another 4-bit operand, making the total 
instruction word length 40 bits. By doing so, it be- 
comes possible to define instructions with high code 
efficiency whereby a maximum of four operations 
can be executed by a single instruction with the 
comparatively short word length of 40 bits. 

(2) The instruction 50 of the above embodiment only 
includes one field (the P1.0 field 52) that uses an 
implicit operand (the stored value of the constant 
register 36), although the present invention need 
not be limited to this, and so have two or more such 
fields. This can be handled by appropriately defin- 
ing a new instruction format. 

(3) In the above embodiment, an example dealing 
with a numerical constant is given, although it is of 
course equally possible for the invention to deal with 
a character constant. This is because a long char- 
acter constant that is divided across a plurality of 
instructions can be accumulated by successively 
storing different parts of the character constant in 
the constant register 36. 


art. Therefore, unless such changes and modifications 
depart from the scope of the present invention, they 
should be construed as being included therein. 

5 

Claims 

1 . A VLIW (Very Long Instruction Word) processor that 
decodes and executes an instruction that has at 

10 least two operation fields, of which a first operation 
field can only include one operation code for spec- 
ifying an operation type and a second operation 
field includes a combination of one operation code 
and at least one operand used in an operation indi- 

15 cated by the second operation field, 
the VLIW processor comprising: 

first decoding means for decoding the opera- 
tion code in the first operation field; 
20 first execution means for executing an opera- 

tion indicated by the operation code in the first 
operation field in accordance with a decoding 
result of the first decoding means; 
second decoding means for decoding the op- 
2S e ration code in the second operation field; and 

second execution means for executing the op- 
eration indicated by the operation code in the 
second operation field on data which is indicat- 
ed by the operands in the second operation 
30 field, in accordance with a decoding result of 

the second decoding means. 

2. The VLIW processor of Claim 1 , 

wherein a number of bits occupied by the first 
35 operation field in the instruction is less that a 
number of bits occupied by the second operation 
field. 


3. The VLIW processor of Claim 2, 
40 wherein a number of bits occupied by the op- 

eration code in the first operation field is equal to a 
number of bits occupied by the operation code in 
the second operation field. 

45 4. The VLIW processor of Claim 3, 

wherein the instruction includes three operation 
fields, 

wherein a third operation field in the three op- 
50 eration fields occupies a same number of bits 

as the second operation field and includes a 
combination of one operation code and at least 
one operand, 


25 


(4) As can be seen from the instruction formats 
shown in Figs. 2B to 2D of the above embodiment, 
only a 4-bit or a 1 6-bit constant can be stored in the 
constant register 36 in the above embodiment by a 
single instruction, although this is not a limitation for 
the present invention. As examples, it is equally 
possible to define an instruction format whereby a 
12-bit or a 28-bit constant can be stored in the con- 
stant register 36 by a single instruction. To do so, it 
is only necessary to change the connection pattern 
of the peripheral circuits of the constant register 36. 


Although the present invention has been fully de- 
scribed by way of examples with reference to accompa- 
nying drawings, it is to be noted that various changes 
and modifications will be apparent to those skilled in the 


55 the VLIW processor further comprising: 

third decoding means which decodes, when an 
operation code is present in the third operation 
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field, the operation code in the third operation 
field; and 

third executing means for executing an opera- 
tion indicated by the operation code in the third 
operation field on data which is indicated by the s 
operands in the third operation field, in accord- 
ance with a decoding result of the third decod- 
ing means. 

5. The VLIW processor of Claim 4, io 

wherein the first executing means controls a 
control flow of a program including the instruction. 

6. The VLIW processor of Claim 5, 

75 

wherein the second executing means controls 
transfer of the data that is indicated by the op- 
erands included in the second operation field, 
and 

the third executing means controls executes an 20 
arithmetic logic operation on the data that is in- 
dicated by the operands included in the third 
operation field. 

7. A VLIW processor that decodes and executes an 2s 
instruction that has at least two operation fields, of 
which 

a first operation field can only include one of (i) 
a single operation code for specifying an oper- 30 
ation type and (ii) a constant, and 
a second operation field includes one of (i) a 
combination of one operation code and at least 
one operand used in an operation indicated by 
the second operation field and (ii) a constant, 35 

the VLIW processor comprising: 

first decoding means which decodes, when an 
operation code is present in the first operation 40 
field, the operation code in the first operation 
field; 

first executing means for executing an opera- 
tion indicated by the operation code in the first 
operation field, in accordance with a decoding 45 
result of the first decoding means; 
second decoding means which decodes, when 
an operation code is present in the second op- 
eration field, the operation code in the second 
operation field; and 50 
second executing means for executing an op- 
eration indicated by the operation code in the 
second operation field on data which is indicat- 
ed by the operands in the second operation 
field, in accordance with a decoding result of ss 
the second decoding means. 

8. The VLIW processor of Claim 7, 
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wherein the instruction also includes a format 
field including a format code indicating whether only 
a constant is located in the first operation field and 
whether only a constant is located in the second op- 
eration field, 

the VLIW processor further comprising: 

format decoding means for decoding the format 
code; and 

constant storage means for extracting, when a 
decoding result of the format decoding means 
shows that only a constant is present in at least 
one of the first operation field and the second 
operation field, the constant in the instruction 
and storing the extracted constant. 

9. The VLIW processor of Claim 8, wherein 
wherein a number of bits occupied by the first 

operation field in the instruction is less that a 
number of bits occupied by the second operation 
field. 

10. The VLIW processor of Claim 9, 
wherein a number of bits occupied by the op- 
eration code in the first operation field is equal to a 
number of bits occupied by the operation code in 
the second operation field. 

11. The VLIW processor of Claim 10, 

wherein the instruction includes three operation 
fields, 

wherein a third operation field in the three op- 
eration fields occupies a same number of bits 
as the second operation field and includes a 
combination of one operation code and at least 
one operand, 

the VLIW processor further comprising: 

third decoding means which decodes, when an 
operation code is present in the third operation 
field, the operation code in the third operation 
field; and 

third executing means for executing an opera- 
tion indicated by the operation code in the third 
operation field on data which is indicated by the 
operands in the third operation field, in accord- 
ance with a decoding result of the third decod- 
ing means. 

12. The VLIW processor of Claim 11 , 
wherein the first executing means controls a 

control flow of a program including the instruction. 

13. The VLIW processor of Claim 12, 
wherein the second executing means controls 
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transfer of the data that is indicated by the op- 
erands included in the second operation field, 
and 

the third executing means controls executes an 
arithmetic logic operation on the data that is in- 
dicated by the operands included in the third 
operation field. 

14. The VLIW processor of Claim 13, 

wherein the format field, the first operation 
field, the operation code in the second operation 
field, each operand in the second operation field, 
the operation code in the third operation field, and 
each operand in the third operation field each occu- 
py n bits in the instruction. 

15. The VLIW processor of Claim 14, 

wherein the instruction is 32 bits long and n is 

4. 

16. A VLIW processor that decodes and executes an 
instruction that has at least three operation fields, 
of which 

a first operation field includes one operation 
code for controlling a control flow of a program 
including the instruction, 
a second operation field includes an operation 
code for controlling a data transfer, and 
a third operation field includes an operation 
code for controlling an arithmetic logic opera- 
tion on data, 

the VLIW processor comprising: 

first decoding means for decoding the opera- 
tion code in the first operation field; 
first execution means for controlling the control 
flow of the program as indicated by the opera- 
tion code in the first operation field, in accord- 
ance with a decoding result of the first decoding 
means; 

second decoding means for decoding the op- 
eration code in the second operation field; 
second execution means for controlling the da- 
ta transfer indicated by the operation code in 
the second operation field, in accordance with 
a decoding result of the second decoding 
means; 

third decoding means for decoding the opera- 
tion code in the third operation field; and 
third execution means for executing the arith- 
metic logic operation indicated by the operation 
code in the third operation field, in accordance 
with a decoding result of the third decoding 
means. 

17. The VLIW processor of Claim 16, 


wherein a number of bits occupied by the first 
operation field in the instruction is less that a 
number of bits occupied by the second operation 
field and the number of bits occupied by the third 
5 operation field. 

18. The VLIW processor of Claim 17, 

wherein a number of bits occupied by the op- 
eration code in the first operation field is equal to a 
10 number of bits occupied by the operation code in 
the second operation field and a number of bits oc- 
cupied by the operation code in the third operation 
field. 

15 19. The VLIW processor of Claim 17, 

wherein the second operation field occupies 
a same number of bits as the third operation field. 

20. The VLIW processor of Claim 17, 

20 wherein the instruction is 32 bits long. 

21. A VLIW processor that decodes and executes an 
instruction that has a format field and at least three 
operation fields, of which 

25 

a first operation field can only include one of (i) 
an operation code for controlling a control flow 
of a program including the instruction and (ii) a 
constant, 

30 a second operation field includes one of (i) an 

operation code for controlling a transfer of data 
and (ii) a constant, and 

a third operation field includes one of (i) an op- 
eration code for controlling an arithmetic logic 

35 operation on data and (ii) a constant, 

the format field including a format code that in- 
dicates whether a constant is located in the first 
operation field, whether a constant is located in 
the second operation field, and whether a con- 

40 stant is located in the third operation field; 

the VLIW processor comprising: 

first decoding means for decoding the opera- 
45 tion code in the first operation field; 

first execution means for executing a control of 
the control flow of the program as indicated by 
the operation code in the first operation field, in 
accordance with a decoding result of the first 
so decoding means; 

second decoding means for decoding the op- 
eration code in the second operation field; 
second execution means for executing a con- 
trol of the data transfer indicated by the opera- 
55 tion code in the second operation field, in ac- 

cordance with a decoding result of the second 
decoding means; 

third decoding means for decoding the opera- 
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tion code in the third operation field; 
third execution means for executing the arith- 
metic logic operation indicated by the operation 
code in the first operation field, in accordance 
with a decoding result of the third decoding s 
means; 

format decoding means for decoding the format 
code; and 

constant storage means for extracting, when a 
decoding result of the format decoding means io 
shows that a constant is present in at least one 
of the first operation field, the second operation 
field, and the third operation field, the constant 
in the instruction and storing the extracted con- 
stant. 15 

22. The VLIW processor of Claim 21 , 

wherein the instruction is 32 bits long. 

23. A VLIW processor, comprising: 20 

fetch means for fetching an L-bit instruction that 
includes n operation fields; and 
n operation units which are each associated 
with a different one of the n operation fields in 25 
the fetched instruction and each independently 
execute an operation indicated in the associat- 
ed operation field in parallel with each other; 
the VLIW processor being characterized by the 
^operation fields not all being a same size, and 30 
by L not being an integer multiple of n. 

24. The VLIW processor of Claim 23, 

wherein at least one of the n operation fields 
has a different length to other operation fields in the 35 
n operation fields. 

25. The VLIW processor of Claim 24, 

wherein n is 3 and L is 32. 

40 

26. The VLIW processor of Claim 23, 

wherein a number of operands included in at 
least one operation field out of the noperation fields 
is different to a number of operands in other oper- 
ation fields in the n operation fields. *s 

27. The VLIW processor of Claim 26, 

wherein n is 3 and L is 32. 

28. The VLIW processor of Claim 23, so 

wherein the n operation fields include at least 
one operation field composed of only an operation 
code and at least one operation field composed of 
an operation code and at least one operand. 

55 

29. The VLIW processor of Claim 28, 

wherein n is 3 and L is 32. 
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